Markov Decision Processes with Average-Value-at-Risk criteria
نویسندگان
چکیده
P We investigate the problem of minimizing the Average-Value-at-Risk (AV aRτ ) of the discounted cost over a finite and an infinite horizon which is generated by a Markov Decision Process (MDP). We show that this problem can be reduced to an ordinary MDP with extended state space and give conditions under which an optimal policy exists. We also give a time-consistent interpretation of the AV aRτ . At the end we consider a numerical example which is a simple repeated casino game. It is used to discuss the influence of the risk aversion parameter τ of the AV aRτ -criterion.
منابع مشابه
Conditional Value-at-Risk for Random Immediate Reward Variables in Markov Decision Processes
We consider risk minimization problems for Markov decision processes. From a standpoint of making the risk of random reward variable at each time as small as possible, a risk measure is introduced using conditional value-at-risk for random immediate reward variables in Markov decision processes, under whose risk measure criteria the risk-optimal policies are characterized by the optimality equa...
متن کاملControlled Markov Processes with AVaR Criteria for Unbounded Costs
In this paper, we consider the control problem with the Average-Value-at-Risk (AVaR) criteria of the possibly unbounded L1-costs in infinite horizon on a Markov Decision Process (MDP). With a suitable state aggregation and by choosing a priori a global variable s heuristically, we show that there exist optimal policies for the infinite horizon problem. Mathematics Subject Classification: 90C39,...
متن کاملControlled Markov Decision Processes with AVaR criteria for unbounded costs
In this paper, we consider the control problem with the Average-Value-at-Risk (AVaR) criteria of the possibly unbounded L1-costs in infinite horizon on a Markov Decision Process (MDP). With a suitable state aggregation and by choosing a priori a global variable s heuristically, we show that there exist optimal policies for the infinite horizon problem for possibly unbounded costs. Mathematics S...
متن کاملRisk-Sensitive and Average Optimality in Markov Decision Processes
Abstract. This contribution is devoted to the risk-sensitive optimality criteria in finite state Markov Decision Processes. At first, we rederive necessary and sufficient conditions for average optimality of (classical) risk-neutral unichain models. This approach is then extended to the risk-sensitive case, i.e., when expectation of the stream of one-stage costs (or rewards) generated by a Mark...
متن کاملDeveloping a model for simulating urban expansion based on the concept of decision risk: A case study in Babol city
Today, the study of the spatial-temporal pattern of urban physical expansion and the identification of the parameters affecting the expansion play a crucial role in urban-related decision-making and long-term planning processes. Consequently, the use of precise and efficient methods to predict the physical expansion of urban areas is of great importance. The objective of present study is to pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Math. Meth. of OR
دوره 74 شماره
صفحات -
تاریخ انتشار 2011